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(57) Abstract 



The present invention relates to novel SH3 domain binding protein, referred to herein as a DEP polypeptides. The DEF polypeptides 
comprise several motifs including a sre SH3 consensus binding sequence, four ankyrin repeats, one zinc finger domain and six copies of 
a proline-rich tandem repeat DEF polypeptides may function as mediators of SH3 domain-dependent signal transduction pathways and, 
thus may mediate multiple signaling events such as cellular gene expression, cytoskeletal architecture, protein trafficking and endocytosis, 
cell adhesion, migration, proliferation and differentiation. Described herein are isolated and antisense nucleic acids molecules, recombinant 
expression vectors, host cells and non-human transgenic animals containing an insertion or a disruption of the DEP gene. Diagnostic, 
screening and therapeutic methods utilizing the compositions of (he invention are also provided. 
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Met Glu Gly Glu Val lie Val Val Thr Gly Glu Glu Asp Gin Glu Trp 
1090 1095 1100 

Trp He Gly His He Glu Gly Gin Pro Glu Arg Lys Gly Val Phe Pro 
5 1105 1110 1115 1120 

Val Ser Phe Val His He Leu Ser Asp 
1125 

10 (2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4382 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



20 



35 



(ii) MOLECULE TYPE: cDNA 



( ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 351.. 3803 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GACAAAAGCT GGAGCTCGCG CGCCTGCAGG TCGACACTAG TGGATCCAAA GAATTCGGCA 60 
30 CGAGCTCCGG CCCCCTCCAA ACTCACATGC CGGACTCCCG CTTCCTGTCC AGCAGCTCCA 120 
GATGGGGCAG ATCAATGCGC GCATTCCTGC TCATTGTAAC TGTAGCGGCA TGTGATTTCA 180 
GCCCGTAATG TCCGCGCGCT GGACGGAGCA CAATGCGCTG AATATGGTGC CACTCGGAAA 240 
CACGGAGCTG TACGCACAAT CTGCTTTGCA ATTACTTTTT AATCTGTTAA TACGGAGTGA 300 



AACCGCAGCT GTCTCGCTCA GGGTTGTTTT GCTGAGGTGA CTACAGAGCC ATG AGG 356 

Met Arg 

40 i 

TCC TCG TCC TCG CGT TTG TCA AGT TTT TCC TCC AGG GAT TCA TTA TGG 404 
Ser Ser Ser Ser Arg Leu Ser Ser Phe Ser Ser Arg Asp Ser Leu Trp 
5 10 15 

45 

AGT CGG ATG CCG GAT CAG ATC TCC GTG TCC GAG TTT CTC TCG GAG ACG 452 
Ser Arg Met Pro Asp Gin He Ser Val Ser Glu Phe Leu Ser Glu Thr 
20 25 30 

50 ACG GAG GAT TAC AAT TCC CCC ACG ACC TCG AGC TTC ACC ACC CGC CTG 500 
Thr Glu Asp Tyr Asn Ser Pro Thr Thr Ser Ser Phe Thr Thr Arg Leu 
35 40 45 50 
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CAG AGC TGC CGG AAC ACG GTC AAT GTT CTG GAA GAG GCT TTG GAT CAG 548 
Gin Ser Cys Arg Asn Thr Val Asn Vai Leu Glu Glu Ala Lea Asp Gin 
55 60 65 

GAC CGA ACT GCT TTA CAG AAG GTC AAG AAA TCT GTC AAA GCA ATC TAC 596 
Asp Arg Thr Ala Leu Gin Lys Val Lys Lys Ser Val Lys Ala lie Tyr 
70 75 80 

AAC TCG GGT CAA GAA CAT GTG CAG AAT GAA GAG AAT TAT GGA CAG GCA 644 
Asn Ser Gly Gin Glu His Val Gin Asn Glu Glu Asn Tyr Gly Gin Ala 
85 90 95 

CTG GAC AAG TTT GGC AGC AAC TTC ATC AGC CGA GAT AAC TCT GAT CTG 692 
Leu Asp Lys Phe Gly Ser Asn Phe lie Ser Arg Asp Asn Ser Asp Leu 
100 105 HO 

GGA ACA GCC TTC ATC AAG TTT TCT GGA CTT ATC AAA GAG CTG GCT GCT 740 
Gly Thr Ala Phe He Lys Phe Ser Gly Leu He Lys Glu Leu Ala Ala 
115 120 125 130 

CTC CTC AAG AAC CTG CTC CAG AGC CTC AGC CAC AAC GTC ATC TTC ACC 788 
Leu Leu Lys Asn Leu Leu Gin Ser Leu Ser His Asn Val He Phe Thr 
135 140 145 

CTG GAC TCT CTG CTC AAA GGA GAT CTA AAG GGA GTG AAG GGG GAC CTT 836 
Leu Asp Ser Leu Leu Lys Gly Asp Leu Lys Gly Val Lys Gly Asp Leu 
150 155 160 

AAA AAG CCT TTC GAC AAG GCC TGG AAA GAC TAT GAA ACC AAG TTC ACA 884 
Lys Lys Pro Phe Asp Lys Ala Trp Lys Asp Tyr Glu Thr Lys Phe Thr 
165 170 175 

AAG ATC GAG AAG GAG AAG AGA GAA CAT GCC AAG CAG CAC GGC ATG ATC 932 
Lys He Glu Lys Glu Lys Arg Glu His Ala Lys Gin His Gly Met He 
180 185 190 

CGC ACA GAA ATC ACC GGC GCA GAG ATT GCA GAA GAG ATG GAG AAG GAG 980 
Arg Thr Glu He Thr Gly Ala Glu He Ala Glu Glu Met Glu Lys Glu 
195 200 205 210 

CGG AGG ATC TTT CAG CTG CAG ATG TGT GAG TAC CTG ATC AAA GTC AAT 1028 
Arg Arg He Phe Gin Leu Gin Met Cys Glu Tyr Leu He Lys Val Asn 
215 220 225 

GAG ATT AAG ACC AAG AAG GGA GTG GAT CTC CTC CAG AAT CTC ATC AAG 1076 
Glu He Lys Thr Lys Lys Gly Val Asp Leu Leu Gin Asn Leu He Lys 
230 235 240 

TAT TAT CAT GCA CAG TGC AAT TTC TTC CAG GAT GGC TTG AAA ACT GCT 1124 
Tyr Tyr His Ala Gin Cys Asn Phe Phe Gin Asp Gly Leu Lys Thr Ala 
245 250 255 

GAC AAG TTG AAG CAG TAT ATT GAA AAA TTA GCA GCT GAT CTT TAT AAT 1172 
Asp Lys Leu Lys Gin Tyr He Glu Lys Leu Ala Ala Asp Leu Tyr Asn 
260 265 270 
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ATA AAA CAG ACT CAG GAT GAG GAG AAA AAA CAG CTC ACA GCT CTC AGA 1220 
lie Lys Gin Thr Gin Asp Glu Glu Lys Lys Gin Leu Thr Ala Leu Arg 
275 280 285 290 

GAC CTC ATC AAA TCT TCC TTA CAG CTG GAC CAG AAG GAG GAT TCT CAG 1268 
Asp Leu lie Lys Ser Ser Leu Gin Leu Asp Gin Lys Glu Asp Ser Gin 
295 300 305 

AGT AAG CAG AGC GGG TAC AGC ATG CAC CAG CTG CAG GGC AAT AAG GAG 1316 
Ser Lys Gin Ser Gly Tyr Ser Met His Gin Leu Gin Gly Asn Lys Glu 
310 315 320 

TTT GGC AGT GAG AAG AAG GGC TAT CTC TTC AAG AAG AGT GAT GGG ATC 1364 
Phe Gly Ser Glu Lys Lys Gly Tyr Leu Phe Lys Lys Ser Asp Gly lie 
325 330 335 

CGT AAG GTG TGG CAG AGG AGG AAG TGC TCA GTG AAA AAT GGC ATC CTC 1412 
Arg Lys Val Trp Gin Arg Arg Lys Cys Ser Val Lys Asn Gly lie Leu 
340 345 350 

ACC ATC TCT CAT GCC ACA TCC AAC AGG CAG CCG GTG AGA CTG AAT CTG 1460 
Thr lie Ser His Ala Thr Ser Asn Arg Gin Pro Val Arg Leu Asn Leu 
355 360 365 370 

CTG ACC TGC CAG GTT AAA CCC AGT GGA GAG GAT AAG AAG TGC TTT GAC 1508 
Leu Thr Cys Gin Val Lys Pro Ser Gly Glu Asp Lys Lys Cys Phe Asp 
375 380 385 

CTC ATC TCT CAT AAT CGA ACA TAT CAT TTC CAG GCA GAG GAC GAA CAG 1556 
Leu He Ser His Asn Arg Thr Tyr His Phe Gin Ala Glu Asp Glu Gin 
390 395 400 

GAG TTT GTG ATA TGG ATC TCG GTG CTG ACT AAT AGT AAG GAG GAG GCT 1604 
Glu Phe Val He Trp He Ser Val Leu Thr Asn Ser Lys Glu Glu Ala 
405 410 415 

CTG AAC ATG GCA TTT CGT GGG GAG CAG AGT GCT GGA GAT GAC AGT TTG 1652 
Leu Asn Met Ala Phe Arg Gly Glu Gin Ser Ala Gly Asp Asp Ser Leu 
420 425 430 

GAG GAC TTG ACC AAA GCC ATC ATC GAG GAC GTG CTG CGC ATT CCT GGA 1700 
Glu Asp Leu Thr Lys Ala He He Glu Asp Val Leu Arg He Pro Gly 
435 440 445 450 

AAC GAA GTC TGC TGT GAC TGT GGG GTT CCA GAG CCC AAA TGG TTA TCC 1748 
Asn Glu Val Cys Cys Asp Cys Gly Val Pro Glu Pro Lys Trp Leu Ser 
455 460 465 

ACT AAC CTC GGC ATC CTG ACG TGC ATC GAG TGT TCA GGA ATC CAC AGG 1796 
Thr Asn Leu Gly He Leu Thr Cys lie Glu Cys Ser Gly He His Arg 
470 475 480 
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GAA ATG GGA GTC CAT ATT TCG CGC ATC CAA TCC ATG GAG CTT GAC AAA 1844 
Glu Met Gly Val His lie Ser Arg lie Gin Ser Met Glu Leu Asp Lys 
485 490 495 

CTT GGA ACC TCT GAA CTC TTG CTG GCT AAG AAC GTG GGC AAC AGT AGT 1892 
Leu Gly Thr Ser Glu Leu Leu Leu Ala Lys Asn Val Gly Asn Ser Ser 
500 505 510 

TTC AAC GAA ATA TTA GAA GGG AAT CTG CCG AGT CCT TCA CCA AAG CCA 1940 
Phe Asn Glu lie Leu Glu Gly Asn Leu Pro Ser Pro Ser Pro Lys Pro 
515 520 525 530 

GCG CCA TCA AGT GAC ATG ACC GAG AGG AAG GAG TAC ATC AAT GCG AAG 1988 
Ala Pro Ser Ser Asp Met Thr Glu Arg Lys Glu Tyr lie Asn Ala Lys 
535 540 545 

TAC GTG GAG CAC AGG TTC GCT CGG CGA ACG GCC ACT ACA GCC ACA GCC 2036 
Tyr Val Glu His Arg Phe Ala Arg Arg Thr Ala Thr Thr Ala Thr Ala 
550 555 560 

AGA CAG GGC GAC TTG TAC GAG GCG GTG AGA ACG CGA GAC TTG ATG GCT 2084 
Arg Gin Gly Asp Leu Tyr Glu Ala Val Arg Thr Arg Asp Leu Met Ala 
565 570 575 

CTC ATT CAG CTC TAT GCA GAT GGA GTG GAG CTA ATG GAT CCT TTC CCA 2132 
Leu He Gin Leu Tyr Ala Asp Gly Val Glu Leu Met Asp Pro Phe Pro 
580 585 590 

GAA GCA GGA CAG GAC CCG GGA GAG ACA GCT CTG CAC TTT GCT GTT CGG 2180 
Glu Ala Gly Gin Asp Pro Gly Glu Thr Ala Leu His Phe Ala Val Arg 
595 600 605 610 



ACA TCA GAC CAG ACT TCC CTG CAC CTG GTG GAC TTT CTT GTC CAA AAC 2228 
Thr Ser Asp Gin Thr Ser Leu His Leu Val Asp Phe Leu Val Gin Asn 
615 620 625 

AGT GGG ACT CTA GAC AGA CAG ACG GAG AGT GGA AAC GCT GCT CTC CAT 2276 
Ser Gly Thr Leu Asp Arg Gin Thr Glu Ser Gly Asn Ala Ala Leu His 
630 635 640 

TAC TGC TGC ACA TAT GAG AAG CCA GAG TGT CTC AAA CTG CTG CTC AGG 2324 
Tyr Cys Cys Thr Tyr Glu Lys Pro Glu Cys Leu Lys Leu Leu Leu Arg 
645 650 655 

GGA AAA CCG TCT ATT GAC CTG GTT AAT CAA AAC GGG GAG ACA GCA TTG 2372 
Gly Lys Pro Ser He Asp Leu Val Asn Gin Asn Gly Glu Thr Ala Leu 
660 665 670 



GAT ATC GCC AGA CGA CTG AGA AAT GTA CAG TGT GAA GAG CTA CTG GTG 
Asp He Ala Arg Arg Leu Arg Asn Val Gin Cys Glu Glu Leu Leu Val 
675 680 685 690 



2420 
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GAG GCA GCA GCC GGG AGG TTT AAT CCT CAT GTG CAT GTG GAG TAT GAG 2468 
Glu Ala Ala Ala Gly Arg Phe Asn Pro His Val His Val Glu Tyr Giu 
695 700 705 

TGG AAT CTG CGG CTG GAG GAG ATT GAT GAG AGT GAC GAT GAC CTG GAT 2516 
Trp Asn Leu Arg Leu Glu Glu lie Asp Glu Ser Asp Asp Asp Leu Asp 
710 715 720 

GAC AAG CCT AGT CCA GTG AAG AAG GAG CGT TCT CCT CGT CCT CAG AGC 2564 
Asp Lys Pro Ser Pro Val Lys Lys Glu Arg Ser Pro Arg Pro Gin Ser 
725 730 735 

TTC TGT CAT TCG TCC AGC GTG TCT CCT CAG GAG AAG TTA ACC CTG CCG 2612 
Phe Cys His Ser Ser Ser Val Ser Pro Gin Glu Lys Leu Thr Leu Pro 
740 745 750 

GGG TAT CTA GGA CAC AGG GAC AAG CAG AGA CTG TCC TAT GGA GCC TTT 2660 
Gly Tyr Leu Gly His Arg Asp Lys Gin Arg Leu Ser Tyr Gly Ala Phe 
755 760 765 770 

GCC AAC CCC GTC TAC AGC ACC TCC ACC GAA ACC CCT GCA TCT CCA GTG 2708 
Ala Asn Pro Val Tyr Ser Thr Ser Thr Glu Thr Pro Ala Ser Pro Val 
775 780 785 

TCA GAG GGA CCC ACC ATA GCC AGC AAG ACC CCT GCA AAA GCT CCG TCC 2756 
Ser Glu Gly Pro Thr lie Ala Ser Lys Thr Pro Ala Lys Ala Pro Ser 
790 795 800 

TGT GGG CCG CCC ACC TCT CTG CCG CTG GGA TCT CAA TCG AGT GCA GGA 2804 
Cys Gly Pro Pro Thr Ser Leu Pro Leu Gly Ser Gin Ser Ser Ala Gly 
805 810 815 

GGC AGC TCC ACT TTG TCT AAG AAG AGA GCT CCT CCT CCA CCT CCC GGA 2852 
Gly Ser Ser Thr Leu Ser Lys Lys Arg Ala Pro Pro Pro Pro Pro Gly 
820 825 830 

CAC AAG CGC ACC CAC TCA GAT CCC CCC AGT CCC GTA CTG CAG GGT CCG 2900 
His Lys Arg Thr His Ser Asp Pro Pro Ser Pro Val Leu Gin Gly Pro 
835 840 845 850 

CAG AGC AAA GGA AGT GAG TCC ACA CCT CCT TCT GCA AAT CGG ACA TCC 2948 
Gin Ser Lys Gly Ser Glu Ser Thr Pro Pro Ser Ala Asn Arg Thr Ser 
855 860 865 

CCG GCC AAC AAG TTT GAG GGA ATC CAG CAG CAG CAA AGC ACT ACG TCT 2996 
Pro Ala Asn Lys Phe Glu Gly lie Gin Gin Gin Gin Ser Thr Thr Ser 
870 875 880 

ATG AAC ACA AAA GCA ACA TTT GGC CCA CGA GTT CTT CCC AAA CTA CCT 3044 
Met Asn Thr Lys Ala Thr Phe Gly Pro Arg Val Leu Pro Lys Leu Pro 
885 890 895 

CAA AAA GTG GCA CTA CGA AAG ATT GAC ACA ATC CAC CTC CCA TCA GTG 3092 
Gin Lys Val Ala Leu Arg Lys lie Asp Thr lie His Leu Pro Ser Val 
900 905 910 



WO 98/36065 



PCT/US98/02724 



-122- 



GAC AAG TCT GGT CCT GAT GTG CTT CAG AAA CCC CCA CAG GCC CAG GAT 3140 
Asp Lys Ser Gly Pro Asp Val Leu Gin Lys Pro Pro Gin Ala Gin Asp 
915 920 925 930 

GCA CCT CCC ACC AGA GCC TCA GAT ACA ATA ACC AGA CCC ACT GAA CCT 3188 
Ala Pro Pro Thr Arg Ala Ser Asp Thr lie Thr Arg Pro Thr Glu Pro 
935 940 945 

CCA CCT AAA ATT CCA CAG GTC GCA GAA CGA TCC CAG CCT GTG GAT GTC 3236 
Pro Pro Lys He Pro Gin Val Ala Glu Arg Ser Gin Pro Val Asp Val 
950 955 960 

CCG CAG AAA CCG CAC ATC TCA GAC CTT CCT CCC AAA CCG CAA CTA TCA 3284 
Pro Gin Lys Pro His He Ser Asp Leu Pro Pro Lys Pro Gin Leu Ser 
965 970 975 

GAT CTT CCC CCC AAA CCC CAA TTG TCG GAT TTA CCA CCA AAA CCT CAG 3332 
Asp Leu Pro Pro Lys Pro Gin Leu Ser Asp Leu Pro Pro Lys Pro Gin 
980 985 990 

CTT TCT GAC CTG CCC CCG AAG CCT CAG CTT AAG GAT CTT CCC CCT AAG 3380 
Leu Ser Asp Leu Pro Pro Lys Pro Gin Leu Lys Asp Leu Pro Pro Lys 
995 1000 1005 1010 

CCG CAG ATC AGT GAT CTG CCA TCC AAA CCG GCC GTG TGT TCT GCG TCT 3428 
Pro Gin He Ser Asp Leu Pro Ser Lys Pro Ala Val Cys Ser Ala Ser 
1015 1020 1025 

GAG GCC ACA CAG AGG CAG TCA ACG CAG GAG GAA ACC AGT CCG AAG CCC 3476 
Glu Ala Thr Gin Arg Gin Ser Thr Gin Glu Glu Thr Ser Pro Lys Pro 
1030 1035 1040 

CAG CTG ACG GAG ACA CAG TCA TTC AGC CAG CAG GAG GAG CTC TCA CCC 3524 
Gin Leu Thr Glu Thr Gin Ser Phe Ser Gin Gin Glu Glu Leu Ser Pro 
1045 1050 1055 

CGA CAG GCC AGC GAG GAC ACC AAT GGA GCG CCC GCA GGA GCC TTG GAA 3572 
Arg Gin Ala Ser Glu Asp Thr Asn Gly Ala Pro Ala Gly Ala Leu Glu 
1060 1065 1070 

ATG CCA GTC CCA ATG CCA CGC AAA ATT AAC ACA GTA GCA AAG AAC AAA 3620 
Met Pro Val Pro Met Pro Arg Lys lie Asn Thr Val Ala Lys Asn Lys 
1075 1080 1085 1090 

GCG AAG CGT GTG AAA ACC ATC TAT GAT TGC CAG GCA GAC AAT GAC GAT 3668 
Ala Lys Arg Val Lys Thr He Tyr Asp Cys Gin Ala Asp Asn Asp Asp 
1095 1100 H05 

GAG CTG ACT TTT GTG GAG GGC GAG GTT ATA ATT GTC ACA GGA GAG GAA 3716 
Glu Leu Thr Phe Val Glu Gly Glu Val He He Val Thr Gly Glu Glu 
1110 1115 1120 
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GAC CAG GAG TGG TGG ATC GGG CAC ATA GAG GGT CAG CCT GAA AGG AAA 3764 
Asp Gin Glu Trp Trp lie Gly His lie Glu Gly Gin Pro Glu Arg Lys 
1125 1130 1135 

GGG GTC TTC CCA ATG TCC TTC GTG CAC ATT CTG TCA GAC TGACAGTGCA 3813 
Gly Val Phe Pro Met Ser Phe Val His lie Leu Ser Asp 
1140 1145 1150 

TGACCGGCAG CCGAGAGGCT CTCTAACTAG CACAAGCTCC GCTCTCTCTG GCCTCACACT 3873 

GGACTGTGGG CATTGCCTCT GTACATAGCT GCTGAAACCC AAACGGTCTC CAAACACATA 3933 

CAAAACTGAA GTATCAAACC CATGCTCCCT TAATCCTCAA GGGTGAAATG TGTAAACTAT 3993 

GTGTTGTTCA TAAACTGTGT TATCCTGCCT ACCAGTATTA TCGTAGCCAT GGCAGCCCAG 4053 

CATGCCATAA CTGGGTTTGC AGTAGCTATA CTTGGAAATC TAGCACTTAA CATGTATGCT 4113 

GTAACTTTGT GTATGTGTAC ACATATAGAA TTATATGTAT GTCCATTTTA AGTGTGTCTT 4173 

TGTACATACA TATGCACAGA CGTAAGTGTA TATTTATGTA CGTATGTATA ATGTACAAGT 4233 

GTGCAAATGT ATGTTAACCC TGCTTGCTTA TGGAGCCAGA GTGACTCTAG ACATTTTAGT 4293 

GTACTGTTTT AAAAAAAAAA AAAAAAAAAC TCGAGAGTAC TTCTAGAGCG GCCGCGGGCC 4353 

CATCGATTTT CCACCCGGGT GGGGTACCA 4382 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1151 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Arg Ser Ser Ser Ser Arg Leu Ser Ser Phe Ser Ser Arg Asp Ser 
15 10 15 

Leu Trp Ser Arg Met Pro Asp Gin He Ser Val Ser Glu Phe Leu Ser 
20 25 30 

Glu Thr Thr Glu Asp Tyr Asn Ser Pro Thr Thr Ser Ser Phe Thr Thr 
35 40 45 

Arg Leu Gin Ser Cys Arg Asn Thr Val Asn Val Leu Glu Glu Ala Leu 
50 55 60 

Asp Gin Asp Arg Thr Ala Leu Gin Lys Val Lys Lys Ser Val Lys Ala 
65 70 75 80 
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Ile Tyr Asn Ser Gly Gin Glu His Val Gin Asn Glu Glu Asn Tyr Gly 
85 SO 95 

Gin Ala Leu Asp Lys Phe Gly Ser Asn Phe lie Ser Arg Asp Asn Ser 
100 105 110 

Asp Leu Gly Thr Ala Phe lie Lys Phe Ser Gly Leu lie Lys Glu Leu 
115 120 125 

Ala Ala Leu Leu Lys Asn Leu Leu Gin Ser Leu Ser His Asn Val lie 
130 135 140 

Phe Thr Leu Asp Ser Leu Leu Lys Gly Asp Leu Lys Gly Val Lys Gly 
145 150 155 160 

Asp Leu Lys Lys Pro Phe Asp Lys Ala Trp Lys Asp Tyr Glu Thr Lys 
165 170 175 

Phe Thr Lys lie Glu Lys Glu Lys Arg Glu His Ala Lys Gin His Gly 
180 185 190 

Met He Arg Thr Glu He Thr Gly Ala Glu He Ala Glu Glu Met Glu 
195 200 205 

Lys Glu Arg Arg He Phe Gin Leu Gin Met Cys Glu Tyr Leu He Lys 
210 215 220 

Val Asn Glu He Lys Thr Lys Lys Gly Val Asp Leu Leu Gin Asn Leu 
225 230 235 240 

He Lys Tyr Tyr His Ala Gin Cys Asn Phe Phe Gin Asp Gly Leu Lys 
245 250 255 

Thr Ala Asp Lys Leu Lys Gin Tyr He Glu Lys Leu Ala Ala Asp Leu 
260 265 270 

Tyr Asn He Lys Gin Thr Gin Asp Glu Glu Lys Lys Gin Leu Thr Ala 
275 280 285 



Leu Arg Asp lieu He Lys Ser Ser 
290 295 

Ser Gin Ser Lys Gin Ser Gly Tyr 
305 310 

Lys Glu Phe Gly Ser Glu Lys Lys 
325 

Gly He Arg Lys Val Trp Gin Arg 
340 

He Leu Thr He Ser His Ala Thr 
355 360 



Leu Gin Leu Asp Gin Lys Glu Asp 
300 

Ser Met His Gin Leu Gin Gly Asn 
315 320 

Gly Tyr Leu Phe Lys Lys Ser Asp 
330 335 

Arg Lys Cys Ser Val Lys Asn Gly 
345 350 

Ser Asn Arg Gin Pro Val Arg Leu 
365 
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Asn Leu Leu Thr Cys Gin Val Lys Pro Ser Gly Glu Asp Lys Lys Cys 
370 375 380 

Phe Asp Leu lie Ser His Asn Arg Thr Tyr His Phe Gin Ala Glu Asp 
5 385 390 395 400 

Glu Gin Glu Phe Val lie Trp lie Ser Val Leu Thr Asn Ser Lys Glu 
405 410 415 

10 Glu Ala Leu Asn Met Ala Phe Arg Gly Glu Gin Ser Ala Gly Asp Asp 
420 425 430 

Ser Leu Glu Asp Leu Thr Lys Ala lie lie Glu Asp Val lieu Arg lie 
435 440 445 

15 

Pro Gly Asn Glu Val Cys Cys Asp Cys Gly Val Pro Glu Pro Lys Trp 
450 455 460 

Leu Ser Thr Asn Leu Gly lie Leu Thr Cys lie Glu Cys Ser Gly He 
20 465 470 475 480 

His Arg Glu Met Gly Val His He Ser Arg He Gin Ser Met Glu Leu 
485 490 495 

25 Asp Lys Leu Gly Thr Ser Glu Leu Leu Leu Ala Lys Asn Val Gly Asn 
500 505 510 

Ser Ser Phe Asn Glu He Leu Glu Gly Asn Leu Pro Ser Pro Ser Pro 
515 520 525 

30 

Lys Pro Ala Pro Ser Ser Asp Met Thr Glu Arg Lys Glu Tyr He Asn 
530 535 540 

Ala Lys Tyr Val Glu His Arg Phe Ala Arg Arg Thr Ala Thr Thr Ala 
35 545 550 555 560 

Thr Ala Arg Gin Gly Asp Leu Tyr Glu Ala Val Arg Thr Arg Asp Leu 
565 570 . 575 

40 Met Ala Leu He Gin Leu Tyr Ala Asp Gly Val Glu Leu Met Asp Pro 
580 585 590 

Phe Pro Glu Ala Gly Gin Asp Pro Gly Glu Thr Ala Leu His Phe Ala 
595 600 605 

45 

Val Arg Thr Ser Asp Gin Thr Ser Leu His Leu Val Asp Phe Leu Val 
610 615 620 

Gin Asn Ser Gly Thr Leu Asp Arg Gin Thr Glu Ser Gly Asn Ala Ala 
50 625 630 635 640 



Leu His Tyr Cys Cys Thr Tyr Glu Lys Pro Glu Cys Leu Lys Leu Leu 
645 650 655 
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Leu Arg Gly Lys Pro Ser He Asp Leu Val Asn Gin Asn Gly Glu Thr 
660 665 670 

Ala Leu Asp He Ala Arg Arg Leu Arg Asn Val Gin Cys Glu Glu Leu 
675 680 685 

Leu Val Glu Ala Ala Ala Gly Arg Phe Asn Pro His Val His Val Glu 
690 695 700 

Tyr Glu Trp Asn Leu Arg Leu Glu Glu He Asp Glu Ser Asp Asp Asp 
705 710 715 720 

Leu Asp Asp Lys Pro Ser Pro Val Lys Lys Glu Arg Ser Pro Arg Pro 
725 730 735 

Gin Ser Phe Cys His Ser Ser Ser Val Ser Pro Gin Glu Lys Leu Thr 
740 745 750 

Leu Pro Gly Tyr Leu Gly His Arg Asp Lys Gin Arg Leu Ser Tyr Gly 
755 760 765 

Ala Phe Ala Asn Pro Val Tyr Ser Thr Ser Thr Glu Thr Pro Ala Ser 
770 775 780 

Pro Val Ser Glu Gly Pro Thr He Ala Ser Lys Thr Pro Ala Lys Ala 
785 790 795 800 

Pro Ser Cys Gly Pro Pro Thr Ser Leu Pro Leu Gly Ser Gin Ser Ser 
805 810 815 

Ala Gly Gly Ser Ser Thr Leu Ser Lys Lys Arg Ala Pro Pro Pro Pro 
820 825 830 

Pro Gly His Lys Arg Thr His Ser Asp Pro Pro Ser Pro Val Leu Gin 
835 840 845 

Gly Pro Gin Ser Lys Gly Ser Glu Ser Thr Pro Pro Ser Ala Asn Arg 
850 855 860 

Thr Ser Pro Ala Asn Lys Phe Glu Gly He Gin Gin Gin Gin Ser Thr 
865 870 875 880 

Thr Ser Met Asn Thr Lys Ala Thr Phe Gly Pro Arg Val Leu Pro Lys 
885 890 895 



Leu Pro Gin Lys Val Ala Leu Arg Lys He Asp Thr He His Leu Pro 
900 905 910 

Ser Val Asp Lys Ser Gly Pro Asp Val Leu Gin Lys Pro Pro Gin Ala 
915 920 925 



Gin Asp Ala Pro Pro Thr Arg Ala Ser Asp Thr He Thr Arg Pro Thr 
930 935 940 
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Glu Pro Pro Pro Lys He Pro Gin Val Ala Glu Arg Ser Gin Pro Val 
945 950 955 960 

Asp Val Pro Gin Lys Pro His He Ser Asp Leu Pro Pro Lys Pro Gin 
965 970 975 

Leu Ser Asp Leu Pro Pro Lys Pro Gin Leu Ser Asp Leu Pro Pro Lys 
980 985 990 

Pro Gin Leu Ser Asp Leu Pro Pro Lys Pro Gin Leu Lys Asp Leu Pro 
995 1000 1005 

Pro Lys Pro Gin He Ser Asp Leu Pro Ser Lys Pro Ala Val Cys Ser 
1010 1015 1020 

Ala Ser Glu Ala Thr Gin Arg Gin Ser Thr Gin Glu Glu Thr Ser Pro 
1025 1030 1035 1040 

Lys Pro Gin Leu Thr Glu Thr Gin Ser Phe Ser Gin Gin Glu Glu Leu 
1045 1050 1055 

Ser Pro Arg Gin Ala Ser Glu Asp Thr Asn Gly Ala Pro Ala Gly Ala 
1060 1065 1070 

Leu Glu Met Pro Val Pro Met Pro Arg Lys He Asn Thr Val Ala Lys 
1075 1080 1085 

Asn Lys Ala Lys Arg Val Lys Thr He Tyr Asp Cys Gin Ala Asp Asn 
1090 1095 1100 

Asp Asp Glu Leu Thr Phe Val Glu Gly Glu Val He He Val Thr Gly 
1105 1110 1115 1120 

Glu Glu Asp Gin Glu Trp Trp He Gly His He Glu Gly Gin Pro Glu 
1125 1130 1135 

Arg Lys Gly Val Phe Pro Met Ser Phe Val His He Leu Ser Asp 
1140 1145 1150 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3456 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
ATGAGGTCCT CGTCCTCGCG TTTGTCAAGT TTTTCCTCCA GGGATTCATT ATGGAGTCGG 



60 
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ATGCCGGATC AGATCTCCGT GTCCGAGTTT CTCTCGGAGA CGACGGAGGA TTACAATTCC 120 

CCCACGACCT CGAGCTTCAC CACCCGCCTG CAGAGCTGCC GGAACACGGT CAATGTTCTG 180 

GAAGAGGCTT TGGATCAGGA CCGAACTGCT TTACAGAAGG TCAAGAAATC TGTCAAAGCA 240 

ATCTACAACT CGGGTCAAGA ACATGTGCAG AATGAAGAGA ATTATGGACA GGCACTGGAC 300 

AAGTTTGGCA GCAACTTCAT CAGCCGAGAT AACTCTGATC TGGGAACAGC CTTCATCAAG 360 

TTTTCTGGAC TTATCAAAGA GCTGGCTGCT CTCCTCAAGA ACCTGCTCCA GAGCCTCAGC 420 

CACAACGTCA TCTTCACCCT GGACTCTCTG CTCAAAGGAG ATCTAAAGGG AGTGAAGGGG 480 

GACCTTAAAA AGCCTTTCGA CAAGGCCTGG AAAGACTATG AAACCAAGTT CACAAAGATC 540 

GAGAAGGAGA AGAGAGAACA TGCCAAGCAG CACGGCATGA TCCGCACAGA AATCACCGGC 600 

GCAGAGATTG CAGAAGAGAT GGAGAAGGAG CGGAGGATCT TTCAGCTGCA GATGTGTGAG 660 

TACCTGATCA AAGTCAATGA GATTAAGACC AAGAAGGGAG TGGATCTCCT CCAGAATCTC 720 

ATCAAGTATT ATCATGCACA GTGCAATTTC TTCCAGGATG GCTTGAAAAC TGCTGACAAG 780 

TTGAAGCAGT ATATTGAAAA ATTAGCAGCT GATCTTTATA ATATAAAACA GACTCAGGAT 840 

GAGGAGAAAA AACAGCTCAC AGCTCTCAGA GACCTCATCA AATCTTCCTT ACAGCTGGAC 900 

CAGAAGGAGG ATTCTCAGAG TAAGCAGAGC GGGTACAGCA TGCACCAGCT GCAGGGCAAT 960 

AAGGAGTTTG GCAGTGAGAA GAAGGGCTAT CTCTTCAAGA AGAGTGATGG GATCCGTAAG 1020 

GTGTGGCAGA GGAGGAAGTG CTCAGTGAAA AATGGCATCC TCACCATCTC TCATGCCACA 1080 

TCCAACAGGC AGCCGGTGAG ACTGAATCTG CTGACCTGCC AGGTTAAACC CAGTGGAGAG 1140 

GATAAGAAGT GCTTTGACCT CATCTCTCAT AATCGAACAT ATCATTTC GA GGCAGAGGAC 1200 

GAACAGGAGT TTGTGATATG GATCTCGGTG CTGACTAATA GTAAGGAGGA GGCTCTGAAC 1260 

ATGGCATTTC GTGGGGAGCA GAGTGCTGGA GATGACAGTT TGGAGGACTT GAC CAAAGCC 1320 

ATCATCGAGG ACGTGCTGCG CATTCCTGGA AACGAAGTCT GCTGTGACTG TGGGGTTCCA 1380 

GAGCCCAAAT GGTTATCCAC TAACCTCGGC ATCCTGACGT GCATCGAGTG TTCAGGAATC 1440 

CACAGGGAAA TGGGAGTCCA TATTTCGCGC ATCCAATCCA TGGAGCTTGA CAAACTTGGA 1500 

ACCTCTGAAC TCTTGCTGGC TAAGAACGTG GGCAACAGTA GTTTCAACGA AATATTAGAA 1560 

GGGAATCTGC CGAGTCCTTC ACCAAAGCCA GCGCCATCAA GTGACATGAC CGAGAGGAAG 1620 

GAGTACATCA ATGCGAAGTA CGTGGAGCAC AGGTTCGCTC GGCGAACGGC CACTACAGCC 1680 

ACAGCCAGAC AGGGCGACTT GTACGAGGCG GTGAGAACGC GAGACTTGAT GGCTCTCATT 1740 
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CAGCTCTATG CAGATGGAGT GGAGCTAATG GATCCTTTCC CAGAAGCAGG ACAGGACCCG 1800 

GGAGAGACAG CTCTGCACTT TGCTGTTCGG ACATCAGACC AGACTTCCCT GCACCTGGTG 1860 

GACTTTCTTG TCCAAAACAG TGGGACTCTA GACAGACAGA CGGAGAGTGG AAACGCTGCT 1920 

CTCCATTACT GCTGCACATA TGAGAAGCCA GAGTGTCTCA AACTGCTGCT CAGGGGAAAA 1980 

CCGTCTATTG ACCTGGTTAA TCAAAACGGG GAGACAGCAT TGGATATCGC CAGACGACTG 2040 

AGAAATGTAC AGTGTGAAGA GCTACTGGTG GAGGCAGCAG CCGGGAGGTT TAATCCTCAT 2100 

GTGCATGTGG AGTATGAGTG GAATCTGCGG CTGGAGGAGA TTGATGAGAG TGACGATGAC 2160 

CTGGATGACA AGCCTAGTCC AGTGAAGAAG GAGCGTTCTC CTCGTCCTCA GAGCTTCTGT 2220 

CATTCGTCCA GCGTGTCTCC TCAGGAGAAG TTAACCCTGC CGGGGTATCT AGGACACAGG 2280 

GACAAGCAGA GACTGTCCTA TGGAGCCTTT GCCAACCCCG TCTACAG CAC CTCCACCGAA 2340 

ACCCCTGCAT CTCCAGTGTC AGAGGGACCC ACCATAGCCA GCAAGACCCC TGCAAAAGCT 2400 

CCGTCCTGTG GGCCGCCCAC CTCTCTGCCG CTGGGATCTC AATCGAGTGC AGGAGGCAGC 2460 

TCCACTTTGT CTAAGAAGAG AGCTCCTCCT CCACCTCCCG GACACAAGCG CACCCACTCA 2520 

GATCCCCCCA GTCCCGTACT GCAGGGTCCG CAGAGCAAAG GAAGTGAGTC CACACCTCCT 2580 

TCTGCAAATC GGACATCCCC GGCCAACAAG TTTGAGGGAA TCCAGCAGCA GCAAAGCACT 2640 

ACGTCTATGA ACACAAAAGC AACATTTGGC CCACGAGTTC TTCCCAAACT ACCTCAAAAA 2700 

GTGGCACTAC GAAAGATTGA CACAATCCAC CTCCCATCAG TGGACAAGTC TGGTCCTGAT 2760 

GTGCTTCAGA AACCCCCACA GGCCCAGGAT GCACCTCCCA CCAGAGCCTC AGATACAATA 2820 

ACCAGACCCA CTGAACCTCC ACCTAAAATT CCACAGGTCG CAGAACGATC CCAGCCTGTG 2880 

GATGTCCCGC AGAAACCGCA CATCTCAGAC CTTCCTCCCA AACCGCAACT ATCAGATCTT 2940 

CCCCCCAAAC CCCAATTGTC GGATTTACCA CCAAAACCTC AGCTTTCTGA CCTGCCCCCG 3000 

AAGCCTCAGC TTAAGGATCT TCCCCCTAAG CCGCAGATCA GTGATCTGCC ATCCAAACCG 3060 

GCCGTGTGTT CTGCGTCTGA GGCCACACAG AGGCAGTCAA CGCAGGAGGA AACCAGTCCG 3120 

AAGCCCCAGC TGACGGAGAC ACAGTCATTC AGCCAGCAGG AGGAGCTCTC ACCCCGACAG 3180 

GCCAGCGAGG ACACCAATGG AGCGCCCGCA GGAGCCTTGG AAATGCCAGT CCCAATGCCA 3240 

CGCAAAATTA ACACAGTAGC AAAGAACAAA GCGAAGCGTG TGAAAACCAT CTATGATTGC 3300 

CAGGCAGACA ATGACGATGA GCTGACTTTT GTGGAGGGCG AGGTTATAAT TGTCACAGGA 3360 
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GAGGAAGACC AGGAGTGGTG GATCGGGCAC ATAGAGGGTC AGCCTGAAAG GAAAGGGGTC 3420 
TTCCCAATGT CCTTCGTGCA CATTCTGTCA GACTGA 3456 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5954 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



( ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 433.. 3378 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GGAGCTCGCG CGCCTGCAGG TCGACACTAG TGGATCCAAA GAATTCGGCA CGAGGCAAAA 60 

TCCAGCACGA CAACCTACAC TCCTGTCCCA AAACAGAAGA GAAGCACATC ACCGCACTGC 120 

TTTATTATCA AACGAGTGGA CTAAATTCCT ACTTAAACTG GAAGAAGTGA GATCCGTGAA 180 

AGAAAGAGAG GGAAAAAGAG AGAGATTTCC CCGTCGTACA AGCCGCACTT CAGTGTAGTT 240 

GGCTAATGAT TTGTATTAAT TCCCAACTTG TTTTAATCCA CCGAGGACAA AACACCGCGA 300 

TGATAAGACT CCAGGACGCT CATGAGAGTT TTAATTCGGC GTTTCATCTC TGAATTTCGA 360 

CATTAAGTGC ACCGCGACCG GCCAAATCAA GGATTAAACA CGACATTTGT GGATTTCGCC 420 

AAAGGAGATA CA ATG CCT GAC CAG ATA ACA GTG GCG GAG TTT GTC ACG 468 
Met Pro Asp Gin lie Thr Val Ala Glu Phe Val Thr 
15 10 

GAG ACA AAT GAA GAT TAT AAA TCG CCC ACC GCC TCA AAC TTC ACC ACC 516 
Glu Thr Asn Glu Asp Tyr Lys Ser Pro Thr Ala Ser Asn Phe Thr Thr 
15 20 25 

AGA ATG ACT CAC TGC AGG AAC ACA GTA TCC GCA CTG GAG GAG GCC CTG 564 
Arg Met Thr His Cys Arg Asn Thr Val Ser Ala Leu Glu Glu Ala Leu 
30 35 40 

GAT GTG GAC CGC AGT GTC CTT TAC AAG ATG AAG AAG TCA GTT AAG GCT 612 
Asp Val Asp Arg Ser Val Leu Tyr Lys Met Lys Lys Ser Val Lys Ala 
45 50 55 60 

ATT TAC GCC TCG GGT CTG GCT CAT GTG GAG AAT GAG GAG CAG TAC ACT 660 
lie Tyr Ala Ser Gly Leu Ala His Val Glu Asn Glu Glu Gin Tyr Thr 
65 70 75 
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CAA GCT CTG GAG AAG TTC GGA GAG AAC TGT GTG TAC AGA GAT GAG CCG 708 
Gin Ala Leu Glu Lys Phe Gly Glu Asn Cys Val Tyr Arg Asp Asp Pro 
80 85 90 

GAC CTG GGA TCA GCC TTC CTG AAG TTC TCC GTC TTC ACC AAG GAG CTC 756 
Asp Leu Gly Ser Ala Phe Leu Lys Phe Ser Val Phe Thr Lys Glu Leu 
95 100 105 

ACG GGA CTC TTC AAG AAC CTG TTT CAG AAC ATG AAT AAT ATC ATT ACC 804 
Thr Ala Leu Phe Lys Asn Leu Phe Gin Asn Met Asn Asn lie lie Thr 
110 115 120 

TTC CCA TTG GAC AGT CTG CTG AAG GGA GAT CTG AAA GGG GTT AAA GGG 852 
Phe Pro Leu Asp Ser Leu Leu Lys Gly Asp Leu Lys Gly Val Lys Gly 
125 130 135 140 

GAT CTC AAG AAG CCC TTC GAT AAA GCC TGG AAA GAC TAC GAG ACT AAA 900 
Asp Leu Lys Lys Pro Phe Asp Lys Ala Trp Lys Asp Tyr Glu Thr Lys 
145 150 155 

GTC TCT AAA ATA GAG AAG GAG AAA AAA GAG CAC GCC CGG CAG CAC GGA 948 
Val Ser Lys lie Glu Lys Glu Lys Lys Glu His Ala Arg Gin His Gly 
160 ' 165 170 

ATG ATC CGG ACG GAG ATC AGC GGA GCA GAG ATA GCA GAA GAG ATG GAA 996 
Met lie Arg Thr Glu lie Ser Gly Ala Glu lie Ala Glu Glu Met Glu 
175 180 185 

AAA GAG CGG CGT TTC TTC CAG CTT CAG ATG TGT GAG TAC CTC CTC AAA 1044 
Lys Glu Arg Arg Phe Phe Gin Leu Gin Met Cys Glu Tyr Leu Leu Lys 
190 195 200 

GTC AAT GAA ATC AAG ATC AAA AAA GGT GTC GAC CTG CTC CAG AAT CTC 1092 
Val Asn Glu lie Lys He Lys Lys Gly Val Asp Leu Leu Gin Asn Leu 
205 210 215 220 

ATC AAA TAC TTC CAC GCA CAG TGC AAC TTC TTT CAG GAT GGT CTC AAA 1140 
He Lys Tyr Phe His Ala Gin Cys Asn Phe Phe Gin Asp Gly Leu Lys 
225 230 235 

GCG GTG GAC AAC CTC AAA CCC TCA ATA GAA AAA CTG GCC ACA GAC TTG 1188 
Ala Val Asp Asn Leu Lys Pro Ser He Glu Lys Leu Ala Thr Asp Leu 
240 245 250 

CAC TCG ATC AAA CAG GTA CAG GAT GAA GAA CGC AGA CAG CTA ACC CAG 1236 
His Ser He Lys Gin Val Gin Asp Glu Glu Arg Arg Gin Leu Thr Gin 
255 260 265 



TTA CGG GAT GTG CTA AAA ACT GCT CTG CAA GTG GAG CAG AAG GAG GAC 
Leu Arg Asp Val Leu Lys Thr Ala Leu Gin Val Glu Gin Lys Glu Asp 
270 275 280 
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TCT CAG GTT AGA CAG AGC GCC ACC TAC AGT CTG CAC CAG CCG CAG GGC 1332 
Ser Gin Val Arg Gin Ser Ala Thr Tyr Ser Leu His Gin Pro Gin Gly 
285 290 295 300 



AAC AAA GAG CAT GGG ACT GAG CGC AGC GGC AAC CTT TAC AAG AAG AGT 1380 
Asn Lys Glu His Gly Thr Glu Arg Ser Gly Asn Leu Tyr Lys Lys Ser 
305 310 315 

GAC GGG CTG CGG AAA GTG TGG CAG AAG AGA AAG TGC ACA GTA AAG AAT 1428 
Asp Gly Leu Arg Lys Val Trp Gin Lys Arg Lys Cys Thr Val Lys Asn 
320 325 330 

GGA TAT TTG ACC ATC TCA CAT GGG ACG GCA AAC AGA CCT CCC GCC AAA 1476 
Gly Tyr Leu Thr He Ser His Gly Thr Ala Asn Arg Pro Pro Ala Lys 
335 340 345 

CTC AAT CTT CTC ACC TGT CAG GTG AAG CAC AAC CCA GAG GAG AAG AAA 1524 
Leu Asn Leu Leu Thr Cys Gin Val Lys His Asn Pro Glu Glu Lys Lys 
350 355 360 

AGT TTT GAC CTC ATC TCA CAT GAC AGA ACA TAT CAT TTC CAG GCA GAA 1572 
Ser Phe Asp Leu He Ser His Asp Arg Thr Tyr His Phe Gin Ala Glu 
365 370 375 380 

GAT GAG CCA GAG TGT CAA ATA TGG ATC TCA GTG CTG CAG AAC AGT AAA 1620 
Asp Glu Pro Glu Cys Gin He Trp He Ser Val Leu Gin Asn Ser Lys 
385 390 395 

GAA GAG GCG CTC AAC AAC GCC TTC AAG GGC GAC CAG CAT GTT GGT GAA 1668 
Glu Glu Ala Leu Asn Asn Ala Phe Lys Gly Asp Gin His Val Gly Glu 
400 405 410 

AAT AAC ATT GTG CAG GAG CTC ACC AAG GCC ATC CTG GGA GAG GTG AAG 1716 
Asn Asn He Val Gin Glu Leu Thr Lys Ala He Leu Gly Glu Val Lys 
415 420 425 

CGG ATG GCG GGG AAC GAT GTC TGC TGC GAC TGC GGT GCT CCC GGC CCC 1764 
Arg Met Ala Gly Asn Asp Val Cys Cys Asp Cys Gly Ala Pro Gly Pro 
430 435 440 

ACA TGG CTC TCC ACC AAC CTG GGC ATC CTG ACC TGC ATC GAG TGT TCG 1812 
Thr Trp Leu Ser Thr Asn Leu Gly He Leu Thr Cys He Glu Cys Ser 
445 450 455 460 

GGG ATC CAC AGA GAG CTG GGC GTC CAT TAC TCC CGA ATC CAG TCC CTC 1860 
Gly He His Arg Glu Leu Gly Val His Tyr Ser Arg He Gin Ser Leu 
465 470 475 



ACA CTC GAC GTC CTC AGC ACC TCC GAG CTC TTG CTG GCC AAG AAC GTG 
Thr Leu Asp Val Leu Ser Thr Ser Glu Leu Leu Leu Ala Lys Asn Val 
480 485 490 
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GGG AAT GCT GGC TTC AAT GAG ATC ATG GAG GCC TGT CTG ACG GCA GAA 1956 

Gly Asn Ala Gly Phe Asn Glu lie Met Glu Ala Cys Leu Thr Ala Glu 
495 500 505 

GAT GTG ATC AAA CCG AAT CCA GCC AGT GAC ATG CAG GCG AGG AAG GAC 2004 

Asp Val lie Lys Pro Asn Pro Ala Ser Asp Met Gin Ala Arg Lys Asp 

510 515 520 



TTT ATC ATG GCC AAA TAC ACA GAG AAA CGC TTC GCT CGT AAG AAG TGT 2052 
Phe lie Met Ala Lys Tyr Thr Glu Lys Arg Phe Ala Arg Lys Lys Cys 
525 530 535 540 

CCA GAC GCA CTG TCG AAG CTG CAC ACG CTC TGT GAT GCT GTG AAG GCC 2100 
Pro Asp Ala Leu Ser Lys Leu His Thr Leu Cys Asp Ala Val Lys Ala 
545 550 555 

CGG GAC ATT TTC TCT CTC ATC CAG GTC TAT GCT GAA GGA GTG GAT CTG 2148 
Arg Asp He Phe Ser Leu He Gin Val Tyr Ala Glu Gly Val Asp Leu 
560 565 570 

ATG GAG CCC ATT CCT CTG GCT AAT GGA CAT GAA GAA GGT GAG ACG GCT 2196 
Met Glu Pro lie Pro Leu Ala Asn Gly His Glu Gin Gly Glu Thr Ala 
575 580 585 

CTT CAT CTG GCC GTG AGA CTG GTG GAC AGA ACT TCC CTA CAC ATC ATC 2244 
Leu His Leu Ala Val Arg Leu Val Asp Arg Thr Ser Leu His He He 
590 595 600 



GAC TTC CTC ACC CAA AAC AGT TTA AAC CTG GAT AAG GAA ACG GCT AAA 2292 
Asp Phe Leii Thr Gin Asn Ser Leu Asn Leu Asp Lys Gin Thr Ala Lys 
605 610 615 620 

GGA AGC ACA GCT CTG CAT TAC TGC TGC CTG ACG GAC AAC AGC GAG TGT 2340 
Gly Ser Thr Ala Leu His Tyr Cys Cys Leu Thr Asp Asn Ser Glu Cys 
625 630 635 

CTC AAA CTG CTG CTC AGA GGA AAA GCC TCC ATA GAT ATC GCT AAT GAA 2388 
Leu Lys Leu Leu Leu Arg Gly Lys Ala Ser He Asp He Ala Asn Glu 
640 645 650 

GCT GGA GAG ACC CCG TTG GAC ATC GCC AGG CGA CTC AAA CAT CTG CAG 2436 
Ala Gly Glu Thr Pro Leu Asp He Ala Arg Arg Leu Lys His Leu Gin 
655 660 665 



TGT GAG GAA CTG CTG AAC CAG GCT CTT GCA GGG AAG TTC AAT GCT CAT 2484 
Cys Glu Glu Leu Leu Asn Gin Ala Leu Ala Gly Lys Phe Asn Ala His 
670 675 680 



GTG CAT GTG GAG TAT GAG TGG AGA CTT CAG CAT GAA GAC CTG GAC GAG 2532 
Val His Val Glu Tyr Glu Trp Arg Leu Gin His Glu Asp Leu Asp Glu 
685 690 695 700 



AGT GAT GAA GAT CTG GAT GAG AAG TCG AGT CCT CAC CGG CGG GAT GAG 
Ser Asp Glu Asp Leu Asp Glu Lys Ser Ser Pro His Arg Arg Asp Glu 
705 710 715 
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CGG CCC ATC AGC TGC TAG ACA CCG GGC AGT AAC TCC CTT CAG CTG AGT 2628 
Arg Pro lie Ser Cys Tyr Thr Pro Gly Ser Asn Ser Leu Gin Leu Ser 
720 725 730 

CCA GCC AGC CTG AGC CGA GAC GGT CGA GAC CTG GTT AAA GAC AAG CAA 2676 
Pro Ala Ser Leu Ser Arg Asp Gly Arg Asp Leu Val Lys Asp Lys Gin 
735 740 745 

CGC TTT GTG CCA AAC CTG GTC AAC AAT GAA ACC TAC GGG ACC ATC ATT 2724 
Arg Phe Val Pro Asn Leu Val Asn Asn Glu Thr Tyr Gly Thr lie lie 
750 755 760 

AAC ACC AGC TCA CCC GTC AGC CTG TCC TCT TCT GCT CCA CCT CTA CCA 2772 
Asn Thr Ser Ser Pro Val Ser Leu Ser Ser Ser Ala Pro Pro Leu Pro 
765 770 775 780 

CCC CGA AAC CTA GTT CAG CCG TCT GCT CTT GCA GGA CTG ACT CAA GGA 2820 
Pro Arg Asn Leu Val Gin Pro Ser Ala Leu Ala Gly Leu Thr Gin Gly 
785 790 795 

TCT CCC GGC TGG AAG CCT GGC TCT CTG GAT CTG AGC GGC AGA CAG AGA 2868 
Ser Pro Gly Trp Lys Pro Gly Ser Leu Asp Leu Ser Gly Arg Gin Arg 
800 805 810 

TCC TCC TCT GAC CCT CCC AAC ATG CAT CCT CCT GCG CCT CCC TTA CGG 2916 
Ser Ser Ser Asp Pro Pro Asn Met His Pro Pro Ala Pro Pro Leu Arg 
815 820 825 

GTC ACT TCC ACC TCC CTT CTA ATG CCC AGC GGT GCT GCT CCT CCT CTG 2964 
Val Thr Ser Thr Ser Leu Leu Met Pro Ser Gly Ala Ala Pro Pro Leu 
830 835 840 

GCT AAA GCT ACT GGT ATG ATG GAG ACC ATG AAT ATG CAA CCC AAA CCC 3012 
Ala Lys Ala Thr Gly Met Met Glu Thr Met Asn Met Gin Pro Lys Pro 
845 850 855 860 

GGA CAG GGG CCT CCT GGA CAG AAC ATC AAC CGG GCT ACA AGT GCG GAC 3060 
Gly Gin Gly Pro Pro Gly Gin Asn He Asn Arg Ala Thr Ser Ala Asp 
865 870 875 

AAA AAC TTC AGC AAA AGC ACA CTG ATG CGC TCC GGA TCC ATC GAG AGA 3108 
Lys Asn Phe Ser Lys Ser Thr Leu Met Arg Ser Gly Ser He Glu Arg 
880 885 890 

CCA GCT AAA GAA GTC CCA GGA GGC CCA CAA AAC ACC ACT GGT CAA ACT 3156 
Pro Ala Lys Glu Val Pro Gly Gly Pro Gin Asn Thr Thr Gly Gin Thr 
895 900 905 

CTG CCT GCG ACC CAC ATG CCC AGG AAA ACG TAT TTG AAG CCG AAG CGT 3204 
Leu Pro Ala Thr His Met Pro Arg Lys Thr Tyr Leu Lys Pro Lys Arg 
910 915 920 
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GTG AAG GCC ATG TAT AAC TGT GTG GCC GAT AAT CCA GAC GAG CTG ACC 3252 
Val Lys Ala Met Tyr Asn Cys Val Ala Asp Asn Pro Asp Glu Leu Thr 
925 930 935 940 

TTC TCT GAG GGA GAG CTT ATC GTG GTG GAT GGA GAG GAG GAC CAG GAG 3300 
Phe Ser Glu Gly Glu Leu lie Val Val Asp Gly Glu Glu Asp Gin Glu 
945 950 955 

TGG TGG CTG GGC CAC ATT GAG GGA GAG CCA ATG AGA AGA GGA GCG TTT 3348 
Trp Trp Leu Gly His lie Glu Gly Glu Pro Met Arg Arg Gly Ala Phe 
960 965 970 

CCT GTC ACG TTT GTA CAG TTC ATT ATG GAC TGAAGCTCGA GAGATCACAC 3398 
Pro Val Thr Phe Val Gin Phe He Met Asp 
975 980 

ACTGAACTGA TGACGGCACT TCTCTGCCTC TGTGTGGCCT CACTAACCAC CACTATCTTC 3458 

ATCATCATCG TTGTTCTTCC CTTTATGGTG AGGCCTGTAT CTTCACCAAT CTTCCACAAG 3518 

TCCTGCCTCT GGAGAAATCA GCCTTCTGGG CAATAAACGC ACTTTTGAAC TTAATTTATC 3578 

ATGAACACAA TGCTAATGAA TGTCACCAAG ATGAAGGTTT TGTTTCAGGA TCATTCACAT 3638 

CCTTATTTCT TTAGACAGAT CTGTGAATAT AGTCTTATAT GCCCACATTC CACATCTGGC 3698 

AAGGAAAGAC GGAAGCATAG TAGTGAAATG ACAGCCTTTT TGGAGGACTC TGTTGGATAA 3758 

GACGGCTCTG TTAATGGTGC TAAAGCAGGA ATATGCTACA GGAGCTGTCT GTCCTAGGAG 3818 

GAGCGCACTG ATGTCCCCGT TTTCACACTA CCTGCCCCAG TGCTGAGTGC AGAAATAGGT 3878 

TTTCTCCAGC ACTCGCACAT GGGAAATCTC TGAAGTGCAC TGTGTGATGG AGAAACTGAC 3938 

AGACTGAAGA GTGCTTTTGC GCTGGCTGAG GGACGTGAAG ATTAAATGAA AGTAATCTTG 3998 

ACCCTGAAGC TGCTGGGATT TTGGAGCGTT GTGAATGTTC TCTGGCCTCC AGGGAAAGGA 4058 

GAGGAAGAGC ATCCAGGAGC TTTTTTTCTG TATAGGTATT TATAAATCGG AGCTGTTCTG 4118 

TTTTAGACTC TCGTTGATTT TAACGATCTT CCGCAGAACT TGCTTCATTG TGCGAGCAAT 4178 

CTGCTGAATG ATGTCATTTC TTTTTAAAGA GACAGACCAA ACCTTCAAAT AATTAATTTA 4238 

CTCCAGGAGT GTCAAAGTTC CTGGAGGGCC ACAGCCCTGC ACAGTTTAGT TCCAACCCTG 4298 

CTCCAACACA CTTACCTGCA AGTTTCAAAC AAGCCTGAAG AACTTAATTA GTTTGATCAG 4358 

GTGTTTAATC AGGGTTGTGC AGAGCTGCGG CCCTCCAGGA ACTCAGTTTG ACACCTGTGA 4418 

TTTACTCAAT TTACAAAATG TCCAGAGTGC TCTATATCAG CATTTCCCAA CCCTCTTCTT 4478 

GAAGGCACAC CAACAGTACA CATTTTCAAC CTCTTCCTAA GCAAACACGC CTCAATCAAC 4538 

TCAACAGACC ATTAGAAGAG ACTCTAAAAC CTGAAGTAAA TGAGTCAGAT AAGGGAGACT 4598 
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CCCAAAATAT GAACTGTTGG TGTGCCTCCA GGAACACTGT TTGGAAACCT TCTCTATATG 4658 

CTCAATTTGA TGTAATCCAA GTTGTCTGAA GACATACAGT AAACTTAAAT GAGTAAATAG 4718 

ATGGGTTTTA GAGGAAAACT AAACATTTAT TCTCAAGTCT TTACAAACCT TACTTCAGTG 4778 

TTTATTTGGA GCAATGTGGG TACTAAATGT AGGAATCTGT TCATATGGAA ATATATATAT 4838 

ATATATATAT ATATATATAT ATATATATAT ATTCAAAAAA GGTAATAGTG ACTTTAATCG 4898 

TACCAGTTCT GCTTATTTTA TATATGAAAG ATTTGCAACA GAAAAGTGCA AAATTGAGGT 4958 

GGCACAAATG GATTTCAATA CACTGATCCA ATTCTCTAAA TATTGTCTTA TACAATGAAA 5018 

TCCTACAGGA TTGTAATAGC AAATTAAGTT ATTTTCTGAA AATCATTCAC TGTCATTGTC 5078 

AAACAAGGTC AAATCATCAA CTTCACATTT GAATATGGAT TCAGCTTTGG TTTGAGTATT 5138 

CTGGTTACAG GGTGAACATG TTTCATCAAT CATACTGATT AAAGCACTCT TGCCATTTTT 5198 

CACTAATCAT CCTCTGGTTC AATGGAAGAA AAAAGTCATA CTTTTGGCAT GACGGTGAGC 5258 

AAATGACAGC ATTTACATTT GTGGAGGGGG AGTGACTGTC TTTTAAGATG CTTTTGCACA 5318 

GTTTTAAATA GAGTCTGTTT TAATTTAAAC CTTTGGATAA AAGCGTCTGC TAAATTAATA 5378 

AATTTAAACA GATTACGAAG TGTGAATGAC AG CT ATTTTC TACTAGACCG TTTTGGTGTA 5438 

ACCCTGACGG TTGTTCCCTG TAGCAGTAAT AACTCTCTTT CTCTCTCTAG CGCTCTAATT 5498 

GTATTCCAGA GAAAATGAAA ATCTCTCTCA TCACTTCTCC TAATCCTTTG TAAAGCTCAT 5558 

CCATCAGTGA GTGTGTGCAG GAGTAACACA GCAGAGCGTT TTCTGTCAAG AGTGTTTGAT 5618 

GTGGTTGCAG AGCAACTTAG CGTCTGTTAT GTAACTTTTA ATTACAGTCA TGTTAGTCTT 5678 

GATTGAGCTC AGGCCAGTGT GTATACGGCC TGCAGTGATT GTAAATAACT GTAGACTTTT 5738 

TGCTTTGTGC ATATTTAATT GTAAACAGAG AGCTAAACTG ATACTGACTG ATGTGTTGAC 5798 

GTATTGTTAG ATAAGACTGT TACAGTACAC TTTTAACTAC TCACCCCTTT ACCATAAACA 5858 

TTGTTGACGC TAATATATAA TTCATATATG TACAAATAAA GAGTACTTCT AGAGCGGCCG 5918 

CGGGCCCATC GATTTTCCAC CCGGGTGGGT ACCAGG 5954 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 982 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Pro Asp Gin lie Thr Val Ala Glu Phe Val Thr Glu Thr Asn Glu 
15 10 15 

Asp Tyr Lys Ser Pro Thr Ala Ser Asn Phe Thr Thr Arg Met Thr His 
20 25 30 

Cys Arg Asn Thr Val Ser Ala Leu Glu Glu Ala Leu Asp Val Asp Arg 
35 40 45 

Ser Val Leu Tyr Lys Met Lys Lys Ser Val Lys Ala lie Tyr Ala Ser 
50 55 * 60 

Gly Leu Ala His Val Glu Asn Glu Glu Gin Tyr Thr Gin Ala Leu Glu 
65 70 75 80 

Lys Phe Gly Glu Asn Cys Val Tyr Arg Asp Asp Pro Asp Leu Gly Ser 
85 90 95 

Ala Phe Leu Lys Phe Ser Val Phe Thr Lys Glu Leu Thr Ala Leu Phe 
100 105 110 

Lys Asn Leu Phe Gin Asn Met Asn Asn lie lie Thr Phe Pro Leu Asp 
115 120 125 

Ser Leu Leu Lys Gly Asp Leu Lys Gly Val Lys Gly Asp Leu Lys Lys 
130 135 140 

Pro Phe Asp Lys Ala Trp Lys Asp Tyr Glu Thr Lys Val Ser Lys lie 
145 150 155 160 

Glu Lys Glu Lys Lys Glu His Ala Arg Gin His Gly Met lie Arg Thr 
165 170 175 

Glu lie Ser Gly Ala Glu lie Ala Glu Glu Met Glu Lys Glu Arg Arg 
180 185 190 

Phe Phe Gin Leu Gin Met Cys Glu Tyr Leu Leu Lys Val Asn Glu lie 
195 200 205 

Lys lie Lys Lys Gly Val Asp Leu Leu Gin Asn Leu lie Lys Tyr Phe 
210 215 220 

His Ala Gin Cys Asn Phe Phe Gin Asp Gly Leu Lys Ala Val Asp Asn 
225 230 235 240 

Leu Lys Pro Ser lie Glu Lys Leu Ala Thr Asp Leu His. Ser lie Lys 
245 250 255 

Gin Val Gin Asp Glu Glu Arg Arg Gin Leu Thr Gin Leu Arg Asp Val 
260 265 270 
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Leu Lys Thr Ala Leu Gin Val Glu Gin Lys Glu Asp Ser Gin Val Arg 
275 280 285 

Gin Ser Ala Thr Tyr Ser Leu His Gin Pro Gin Gly Asn Lys Glu His 
5 290 295 300 

Gly Thr Glu Arg Ser Gly Asn Leu Tyr Lys Lys Ser Asp Gly Leu Arg 
305 310 315 320 

10 Lys Val Trp Gin Lys Arg Lys Cys Thr Val Lys Asn Gly Tyr Leu Thr 

325 330 335 

He Ser His Gly Thr Ala Asn Arg Pro Pro Ala Lys Leu Asn Leu Leu 
340 345 350 

15 

Thr Cys Gin Val Lys His Asn Pro Glu Glu Lys Lys Ser Phe Asp Leu 
355 360 365 

He Ser His Asp Arg Thr Tyr His Phe Gin Ala Glu Asp Glu Pro Glu 
20 370 375 380 

Cys Gin He Trp He Ser Val Leu Gin Asn Ser Lys Glu Glu Ala Leu 
385 390 395 400 

25 Asn Asn Ala Phe Lys Gly Asp Gin His Val Gly Glu Asn Asn He Val 

405 410 415 

Gin Glu Leu Thr Lys Ala He Leu Gly Glu Val Lys Arg Met Ala Gly 
420 425 430 

30 

Asn Asp Val Cys Cys Asp Cys Gly Ala Pro Gly Pro Thr Trp Leu Ser 
435 440 445 

Thr Asn Leu Gly He Leu Thr Cys He Glu Cys Ser Gly He His Arg 
35 450 455 460 

Glu Leu Gly Val His Tyr Ser Arg He Gin Ser Leu Thr Leu Asp Val 
465 470 475 480 

40 Leu Ser Thr Ser Glu Leu Leu Leu Ala Lys Asn Val Gly Asn Ala Gly 

485 490 495 

Phe Asn Glu He Met Glu Ala Cys Leu Thr Ala Glu Asp Val He Lys 
500 505 510 

45 

Pro Asn Pro Ala Ser Asp Met Gin Ala Arg Lys Asp Phe He Met Ala 
515 520 525 

Lys Tyr Thr Glu Lys Arg Phe Ala Arg Lys Lys Cys Pro Asp Ala Leu 
50 530 535 540 

Ser Lys Leu His Thr Leu Cys Asp Ala Val Lys Ala Arg Asp He Phe 
545 550 555 560 



WO 98/36065 



PCT/US98/02724 



-139- 

Ser Leu lie Gin Val Tyr Ala Glu Gly Val Asp Leu Met Glu Pro He 
565 570 575 

Pro Leu Ala Asn Gly His Glu Gin Gly Glu Thr Ala Leu His Leu Ala 
580 585 590 

Val Arg Leu Val Asp Arg Thr Ser Leu His He He Asp Phe Leu Thr 
595 600 605 

Gin Asn Ser Leu Asn Leu Asp Lys Gin Thr Ala Lys Gly Ser Thr Ala 
610 615 620 

Leu His Tyr Cys Cys Leu Thr Asp Asn Ser Glu Cys Leu Lys Leu Leu 
625 630 635 640 

Leu Arg Gly Lys Ala Ser He Asp He Ala Asn Glu Ala Gly Glu Thr 
645 650 655 

Pro Leu Asp He Ala Arg Arg Leu Lys His Leu Gin Cys Glu Glu Leu 
660 665 670 

Leu Asn Gin Ala Leu Ala Gly Lys Phe Asn Ala His Val His Val Glu 
675 680 685 

Tyr Glu Trp Arg Leu Gin His Glu Asp Leu Asp Glu Ser Asp Glu Asp 
690 695 700 

Leu Asp Glu Lys Ser Ser Pro His Arg Arg Asp Glu Arg Pro He Ser 
705 710 715 720 

Cys Tyr Thr Pro Gly Ser Asn Ser Leu Gin Leu Ser Pro Ala Ser Leu 
725 730 735 

Ser Arg Asp Gly Arg Asp Leu Val Lys Asp Lys Gin Arg Phe Val Pro 
740 745 750 

Asn Leu Val Asn Asn Glu Thr Tyr Gly Thr He He Asn Thr Ser Ser 
755 760 765 

Pro Val Ser Leu Ser Ser Ser Ala Pro Pro Leu Pro Pro Arg Asn Leu 
770 775 780 

Val Gin Pro Ser Ala Leu Ala Gly Leu Thr Gin Gly Ser Pro Gly Trp 
785 790 795 800 

Lys Pro Gly Ser Leu Asp Leu Ser Gly Arg Gin Arg Ser Ser Ser Asp 
805 ' 810 815 

Pro Pro Asn Met His Pro Pro Ala Pro Pro Leu Arg Val Thr Ser Thr 
820 825 830 

Ser Leu Leu Met Pro Ser Gly Ala Ala Pro Pro Leu Ala Lys Ala Thr 
835 840 845 



Gly Met Met Glu Thr Met Asn Met Gin Pro Lys Pro Gly Gin Gly Pro 
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850 855 860 

Pro Gly Gin Asn He Asn Arg Ala Thr Ser Ala Asp Lys Asn Phe Ser 
865 870 875 880 

Lys Ser Thr Leu Met Arg Ser Gly Ser He Glu Arg Pro Ala Lys Glu 
885 890 895 

Val Pro Gly Gly Pro Gin Asn Thr Thr Gly Gin Thr Leu Pro Ala Thr 
900 905 910 

His Met Pro Arg Lys Thr Tyr Leu Lys Pro Lys Arg Val Lys Ala Met 
915 920 925 

Tyr Asn Cys Val Ala Asp Asn Pro Asp Glu Leu Thr Phe Ser Glu Gly 
930 935 940 

Glu Leu He Val Val Asp Gly Glu Glu Asp Gin Glu Trp Trp Leu Gly 
945 950 955 960 

His He Glu Gly Glu Pro Met Arg Arg Gly Ala Phe Pro Val Thr Phe 
965 970 975 

Val Gin Phe He Met Asp 
980 

(2) INFORMATION FOR SEQ 10 NO: 8: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2949 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

ATGCCTGACC AGATAACAGT GGCGGAGTTT GTCACGGAGA CAAATGAAGA TTATAAATCG 60 

CCCACCGCCT CAAACTTCAC CACCAGAATG ACTCACTGCA GGAACACAGT ATCCGCACTG 120 

GAGGAGGCCC TGGATGTGGA CCGCAGTGTC CTTTACAAGA TGAAGAAGTC AGTTAAGGCT 180 

ATTTACGCCT CGGGTCTGGC TCATGTGGAG AATGAGGAGC AGTACACTCA AGCTCTGGAG 240 

AAGTTCGGAG AGAACTGTGT GTACAGAGAT GACCCGGACC TGGGATCAGC CTTCCTGAAG 300 

TTCTCCGTCT TCACCAAGGA GCTCACGGCA CTCTTCAAGA ACCTGTTTCA GAACATGAAT 360 

AATATCATTA CCTTCCCATT GGACAGTCTG CTGAAGGGAG ATCTGAAAGG GGTTAAAGGG 420 

GATCTCAAGA AGCCCTTCGA TAAAGCCTGG AAAGACTACG AGACTAAAGT CTCTAAAATA 480 
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GAGAAGGAGA AAAAAGAGCA CGCCCGGCAG CACGGAATGA TCCGGACGGA GATCAGCGGA 540 
GCAGAGATAG CAGAAGAGAT GGAAAAAGAG CGGCGTTTCT TCCAGCTTCA GATGTGTGAG 600 
TACCTCCTCA AAGTCAATGA AATCAAGATC AAAAAAGGTG TCGACCTGCT CCAGAATCTC 660 
ATCAAATACT TCCACGCACA GTGCAACTTC TTTCAGGATG GTCTCAAAGC GGTGGACAAC 720 
CTCAAACCCT CAATAGAAAA ACTGGCCACA GACTTGCACT CGATCAAACA GGTACAGGAT 780 
GAAGAACGCA GACAGCTAAC CCAGTTACGG GATGTGCTAA AAACTGCTCT GCAAGTGGAG 840 
CAGAAGGAGG ACTCTCAGGT TAGACAGAGC GCCACCTACA GTCTGCACCA GCCGCAGGGC 900 
AACAAAGAGC ATGGGACTGA GCGCAGCGGC AACCTTTACA AGAAGAGTGA CGGGCTGCGG 960 

AAAGTGTGGC AGAAGAGAAA GTGCACAGTA AAGAATGGAT ATTTGACCAT CTCACATGGG 1020 

ACGGCAAACA GACCTCCCGC CAAACTCAAT CTTCTCACCT GTCAGGTGAA GCACAACCCA 1080 

GAGGAGAAGA AAAGTTTTGA CCTCATCTCA CATGACAGAA CATATCATTT CCAGGCAGAA 1140 

GATGAGCCAG AGTGTCAAAT ATGGATCTCA GTGCTGCAGA ACAGTAAAGA AGAGGCGCTC 1200 

AACAACGCCT TCAAGGGCGA CCAGCATGTT GGTGAAAATA ACATTGTGCA GGAGCTCACC 1260 

AAGGCCATCC TGGGAGAGGT GAAGCGGATG GCGGGGAACG ATGTCTGCTG CGACTGCGGT 1320 

GCTCCCGGCC CCACATGGCT CTCCACCAAC CTGGGCATCC TGACCTGCAT CGAGTGTTCG 1380 

GGGATCCACA GAGAGCTGGG CGTCCATTAC TCCCGAATCC AGTCCCTCAC ACTCGACGTC 1440 

CTCAGCACCT CCGAGCTCTT GCTGGCCAAG AACGTGGGGA ATGCTGGCTT CAATGAGATC 1500 

ATGGAGGCCT GTCTGACGGC AGAAGATGTG ATCAAAC CGA ATCCAGCCAG TGACATGCAG 1560 

GCGAGGAAGG ACTTTATCAT GGCCAAATAC ACAGAGAAAC GCTTCGCTCG TAAGAAGTGT 1620 

CCAGACGCAC TGTCGAAGCT GCACACGCTC TGTGATGCTG TGAAGGCCCG GGACATTTTC 1680 

TCTCTCATCC AGGTCTATGC TGAAGGAGTG GATCTGATGG AGCCCATTCC TCTGGCTAAT 1740 

GGACATGAAC AAGGTGAGAC GGCTCTTCAT CTGGCCGTGA GACTGGTGGA CAGAACTTCC 1800 

CTACACATCA TCGACTTCCT CACCCAAAAC AGTTTAAACC TGGATAAGCA AACGGCTAAA 1860 

GGAAGCACAG CTCTGCATTA CTGCTGCCTG ACGGACAACA GCGAGTGTCT CAAACTGCTG 1920 

CTCAGAGGAA AAGCCTCCAT AGATATCGCT AATGAAGCTG GAGAGACCCC GTTGGACATC 1980 

GCCAGGCGAC TCAAACATCT GCAGTGTGAG GAACTGCTGA ACCAGGCTCT TGCAGGGAAG 2040 
TTCAATGCTC ATGTGCATGT GGAGTATGAG TGGAGACTTC AGCATGAAGA CCTGGACGAG 2100 
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AGTGATGAAG ATCTGGATGA GAAGTCGAGT CCTCACCGGC GGGATGAGCG GCCCATCAGC 2160 

TGCTACACAC CGGGCAGTAA CTCCCTTCAG CTGAGTCCAG CCAGCCTGAG CCGAGACGGT 2220 

CGAGACCTGG TTAAAGACAA GCAACGCTTT GTGCCAAACC TGGTCAACAA TGAAACCTAC 2280 

GGGACCATCA TTAACACCAG CTCACCCGTC AGCCTGTCCT CTTCTGCTCC ACCTCTACCA 2340 

CCCCGAAACC TAGTTCAGCC GTCTGCTCTT GCAGGACTGA CTCAAGGATC TCCCGGCTGG 2400 

AAGCCTGGCT CTCTGGATCT GAGCGGCAGA CAGAGATCCT CCTCTGACCC TCCCAACATG 2460 

CATCCTCCTG CGCCTCCCTT ACGGGTCACT TCCACCTCCC TTCTAATGCC CAGCGGTGCT 2520 

GCTCCTCCTC TGGCTAAAGC TACTGGTATG ATGGAGACCA TGAATATGCA ACCCAAACCC 2580 

GGACAGGGGC CTCCTGGACA GAACATCAAC CGGGCTACAA GTGCGGACAA AAACTTCAGC 2640 

AAAAGCACAC TGATGCGCTC CGGATCCATC GAGAGACCAG CTAAAGAAGT CCCAGGAGGC 2700 

CCACAAAACA CCACTGGTCA AACTCTGCCT GCGACCCACA TGCCCAGGAA AACGTATTTG 2760 

AAGCCGAAGC GTGTGAAGGC CATGTATAAC TGTGTGGCCG ATAATCCAGA CGAGCTGACC 2820 

TTCTCTGAGG GAGAGCTTAT CGTGGTGGAT " GGAGAGGAGG ACCAGGAGTG GTGGCTGGGC 2880 

CACATTGAGG GAGAGCCAAT GAGAAGAGGA GCGTTTCCTG TCACGTTTGT ACAGTTCATT 2940 

ATGGACTGA 2949 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4595 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNES S : s ingl e 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 300.. 3008 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GGAGCTCGCG CGCCTGCAGG TCGACACTAG TGGATCCAAA GAATTCGGCA CGAGCAGAAG 60 

TGTTGATCTT GTCAGCTGCT CGTGTGATGG AGTTGTTTAA CGCTTGTGTT CAAAGGCAAA 120 

TCCTCTCCTC ATCGGCCGTT TACATTTTAA CTTCACGCGG AAATTTAAAA CTGAACTAAT 180 

CTCTAAGGAA TGACTGAAAT GGACTTGAGT TGAAGTCTGG TTTTTGAGCG CGAAGCTACA 240 
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ACTTTAAGCA AACTTTCTTT CTTTTTTGGA TCTATTGTGT AGATTTAAAA GG AATAATC 299 

ATG CCT GAT CAG CTG ACA GTG ACT GAG TTT GTG GAT ATT ACC CAT GAG 347 
Met Pro Asp Gin Leu Thr Val Thr Glu Phe Val Asp lie Thr His Glu 
15 10 15 

GAC TAT AAA GCA CCG ACA ACA TCA GTG TTC TGC ACG CGC ATG GCT CAC 395 
Asp Tyr Lys Ala Pro Thr Thr Ser Val Phe Cys Thr Arg Met Ala His 
20 25 30 

TGC AGG AAT ACA GTC GCC GCT CTG GAA GAG GCG CTG GAT CTG GAC CGC 443 
Cys Arg Asn Thr Val Ala Ala Leu Glu Glu Ala Leu Asp Leu Asp Arg 
35 40 45 

AGT GTA CTG CAC AAA ATG AAG AAG TCA GTC AAG GCC ATA AAC AGC TCT 491 
Ser Val Leu His Lys Met Lys Lys Ser Val Lys Ala lie Asn Ser Ser 
50 55 60 

GGT CAG ACT CAT GTA GAG AAC GAG GAG CAG TAC ATC CAG GCC ATA GAG 539 
Gly Gin Thr His Val Glu Asn Glu Glu Gin Tyr He Gin Ala He Glu 
65 70 75 80 

AGG TTT ACG GAT AAC ACT GTG TAC AAA GAT GAC CCT GAG ATG TCC AAT 587 
Arg Phe Thr Asp Asn Thr Val Tyr Lys Asp Asp Pro Glu Met Ser Asn 
85 90 95 

TAC TTC CTC ACA TTC GCT GGT TTC ACC AAG GAG CTT ACT GCT CTT TTC 635 
Tyr Phe Leu Thr Phe Ala Gly Phe Thr Lys Glu Leu Thr Ala Leu Phe 
' 100 105 HO 

AAG AAC TTG CTA CAG AAC ATG AAT AAC ATC ATC ACT TTT CCA CTA GAC 683 
Lys Asn Leu Leu Gin Asn Met Asn Asn He lie Thr Phe Pro Leu Asp 
115 120 125 

AGT CTG CTA AAG GGA GAC CTC AAA GGA GTC AAA GGG GAT TTG AAA AAG 731 
Ser Leu Leu Lys Gly Asp Leu Lys Gly Val Lys Gly Asp Leu Lys Lys 
130 135 140 

CCA TTT GAT AAA GCA TGG AAG GAT TAT GAA ACC AAA CTG AGC AAG ATT 779 
Pro Phe Asp Lys Ala Trp Lys Asp Tyr Glu Thr Lys Leu Ser Lys He 
145 150 155 160 

GAG AAA GAA AAG CGA GAA CAT GCC AAA CAG CAC GGT CTG ATC CGA ACA 827 
Glu Lys Glu Lys Arg Glu His Ala Lys Gin His Gly Leu He Arg Thr 
165 170 175 

GAG ATC AGT GGA GGA GAG ATC GCA GAA GAG ATG GAG AAA GAG AGA CGC 875 
Glu He Ser Gly Gly Glu He Ala Glu Glu Met Glu Lys Glu Arg Arg 
180 185 190 



CTG TTT. CAG CTT CAG ATG TGT GAG TAC CTC ATT AAA GTG AAT GAA ATC 
Leu Phe Gin Leu Gin Met Cys Glu Tyr Leu He Lys Val Asn Glu He 
195 200 205 



923 
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AAA GTC AAA AAG GGG GTC GAC CTG CTT CAC AAC CTC ATC AAA TAC TTT 971 
Lys Val Lys Lys Gly Val Asp Leu Leu His Asn Leu lie Lys Tyr Phe 
210 215 220 

CAT GCC CAG TGC AAT TTC TTT CAG GAT GGG CTA AAG GTC GTG GAC AAT 1019 
His Ala Gin Cys Asn Phe Phe Gin Asp Gly Leu Lys Val Val Asp Asn 
225 230 235 240 

CTG AAA CCT TTC ATG GAA AAG CTT GCC ACA GAC TTA ACC GCG AAC AAA 1067 
Leu Lys Pro Phe Met Glu Lys Leu Ala Thr Asp Leu Thr Ala Asn Lys 
245 250 255 

CAG ACT CAA GAT GCA GAA AGG AAA CAG TTG CTG CAG CTG AAA GAA ACT 1115 
Gin Thr Gin Asp Ala Glu Arg Lys Gin Leu Leu Gin Leu Lys Glu Thr 
260 265 270 

CTT AAA TCT GCT CTA CAG TCT GAG TGT AAG GAG GAT GCT CAG TCA AAG 1163 
Leu Lys Ser Ala Leu Gin Ser Glu Cys Lys Glu Asp Ala Gin Ser Lys 
275 280 2B5 

CAG AAC GCA GGC TAC AGT CTT CAC CAG TTG CAG GGC AAT AAA GCT CAC 1211 
Gin Asn Ala Gly Tyr Ser Leu His Gin Leu Gin Gly Asn Lys Ala His 
290 295 300 

GGC ACG GAG CGC TCT GGG ATG CTC CTC AAA CGC AGC GAG GGA CTG AGG 1259 
Gly Thr Glu Arg Ser Gly Met Leu Leu Lys Arg Ser Glu Gly Leu Arg 
305 310 315 320 

AAA GTT TGG CAG AAA AGG AAG TGC TCT GTG AAA AAT GGA TTG TTG ACT 1307 
Lys Val Trp Gin Lys Arg Lys Cys Ser Val Lys Asn Gly Leu Leu Thr 
325 330 335 

ATT TCA CAT GGA ACG CCC AAT GCA CCG CCA GCA AAC CTG AAC CTC TTA 1355 
lie Ser His Gly Thr Pro Asn Ala Pro Pro Ala Asn Leu Asn Leu Leu 
340 345 350 

ACC TGC CAA GTG AAG CGT AAC CCA GAT GAG AAA AAA TGC TTT GAT CTC 1403 
Thr Cys Gin Val Lys Arg Asn Pro Asp Glu Lys Lys Cys Phe Asp Leu 
355 360 365 

ATA TCA CAT GAC AGA ACG TAT CAC TTC CAG ACT GAG GAT GAG GCA GAG 1451 
lie Ser His Asp Arg Thr Tyr His Phe Gin Thr Glu Asp Glu Ala Glu 
370 375 380 

TGT CAG GTA TGG GTT TCT GTT CTC CAG AAC AGT AAA GAA GAG GCG CTG 1499 
Cys Gin Val Trp Val Ser Val Leu Gin Asn Ser Lys Glu Glu Ala Leu 
385 390 395 400 

AAC AAT GCC TTT AAA GAC GAT CAG AAT GAG GGA GAA AAT AAC ATT GTT 1547 
Asn Asn Ala Phe Lys Asp Asp Gin Asn Glu Gly Glu Asn Asn lie Val 
405 410 415 

CGA GAG CTC ACT AAG GCC ATC GTG GGG GAA GTG AAG AAA ATG AGC GGC 1595 
Arg Glu Leu Thr Lys Ala lie Val Gly Glu Val Lys Lys Met Ser Gly 
420 425 430 
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AAT GAC GTG TGC TGT GAC TGT GGA GCT TCC AAT CCA ACA TGG CTC TCC 1643 
Asn Asp Val Cys Cys Asp Cys Gly Ala Ser Asn Pro Thr Trp Leu Ser . 
435 440 445 

ACA AAC CTG GGT GTG TTG ATT TGC ATT GAA TGC TCT GGG ATC CAT CGG 1691 
Thr Asn Leu Gly Val Leu He Cys He Glu Cys Ser Gly He His Arg 
450 455 460 

GAA ATG GGC GTC CAC TAC TCC CGA ATA CAG TCT CTG ACA CTG GAC CTC 1739 
Glu Met Gly Val His Tyr Ser Arg He Gin Ser Leu Thr Leu Asp Leu 
465 470 475 480 

TTA GGC ACA TCT GAA CTA TTG CTT GCT AAC AGT GTG GGA AAT GCA GCA 1787 
Leu Gly Thr Ser Glu Leu Leu Leu Ala Asn Ser Val Gly Asn Ala Ala 
485 490 495 

TTC AAT GAA ATC ATG GAA GCA AAA CTG TCT TCA GAG ATC CCA AAA CCC 1835 
Phe Asn Glu He Met Glu Ala Lys Leu Ser Ser Glu He Pro Lys Pro 
500 505 510 

TAC CCT TCT AGT GAC ATG CAG GTA CGA AAA GAC TTC ATC ACA GCC AAA 1883 
Tyr Pro Ser Ser Asp Met Gin Val Arg Lys Asp Phe He Thr Ala Lys 
515 520 525 

TAC ACA GAG AAG CGT TTC GCT CAG AAG AAG TAT GCA GAT AAC GCA GCT 1931 
Tyr Thr Glu Lys Arg Phe Ala Gin Lys Lys Tyr Ala Asp Asn Ala Ala 
530 535 540 

CGA CTG CAT GCA CTG TGT GAT GCA GTG AAG TCT CGG GAC ATC TTC TCC 1979 
Arg Leu His Ala Leu Cys Asp Ala Val Lys Ser Arg Asp He Phe Ser 
545 550 555 560 

CTG ATC CAG GTC TAT GCT GAA GGA CTG GAC CTG ATG GAG ACC ATT AAT 2027 
Leu He Gin Val Tyr Ala Glu Gly Leu Asp Leu Met Glu Thr He Asn 
565 570 575 

CAG CCT AAC CAA CAT GAA CCA GGC GAG ACA TCA CTA CAT CTT GCG GTA 2075 
Gin Pro Asn Gin His Glu Pro Gly Glu Thr Ser Leu His Leu Ala Val 
580 585 590 

CGA ATG GTG GAC CGA AAC TCC CTC CAT ATT GTG GAC TTT CTT GTA CAG 2123 
Arg Met Val Asp Arg Asn Ser Leu His He Val Asp Phe Leu Val Gin 
595 600 605 

AAC AGT GGC AAT TTA GAC AAG CAG ACA GCC AAA GGA AGC ACA GCG CTA 2171 
Asn Ser Gly Asn Leu Asp Lys Gin Thr Ala Lys Gly Ser Thr Ala Leu 
610 615 620 

CAT TAT TGC TGC TTG ACT GAT AAC AGT ' GAA TGT ATG AAG CTG CTG CTG 2219 
His Tyr Cys Cys Leu Thr Asp Asn Ser Glu Cys Met Lys Leu Leu Leu 
625 630 635 640 
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CGG GGG AAA GCA TCT GTC AGC ATT ACT AAT GAT GCT GGA GAG ACT GCT 2267 

Arg Gly Lys Ala Ser Val Ser lie Thr Asn Asp Ala Gly Glu Thr Ala 
645 650 655 

CTG GAT TTG GCG CAG CGT CTC AAA CAC TCC AAA TGC GAG GAG CTG CTG 2315 
Leu Asp Leu Ala Gin Arg Leu Lys His Ser Lys Cys Glu Glu Leu Leu 
660 665 670 

ACT CAG GCG CAG ACG GGG AAG TTC AAT GTC CAT GTG CAT GTG GAA TAT 2363 
Thr Gin Ala Gin Thr Gly Lys Phe Asn Val His Val His Val Glu Tyr 
675 680 685 

GAC TGG CGT CTG CAT AAT GAG GAT CTG GAC GAG AGC GAA GAT GAG ATG 2411 
Asp Trp Arg Leu His Asn Glu Asp Leu Asp Glu Ser Glu Asp Glu Met 
690 695 700 

GAG GAC AAG CCC ATT CCC ATC AGG CGT GAG GAG CGT CCA ATA AGC TGT 2459 
Glu Asp Lys Pro lie Pro lie Arg Arg Glu Glu Arg Pro lie Ser Cys 
705 710 715 720 

ATA GTT CCA GGC AGT GGC CCC ATG ATG CCC AAC ATG AGC GCT CTG GCT 2507 
lie Val Pro Gly Ser Gly Pro Met Met Pro Asn Met Ser Ala Leu Ala 
725 730 735 

CGG GAC GTG GCC AAT GTG GTC AAT AAT AAG CAG AGG GCT TTT ATT CCG 2555 
Arg Asp Val Ala Asn Val Val Asn Asn Lys Gin Arg Ala Phe lie Pro 
740 745 750 

AGC ATG ATG ATG AAC GAG ACT TAC GGC ACC ATG CTC GAT CCC AAC TCT 2603 
Ser Met Met Met Asn Glu Thr Tyr Gly Thr Met Leu Asp Pro Asn Ser 
755 760 765 

CCA CCA CTG GGT TTA CCA GGA GTA CCT GGC ATT CCT CTT TTA CCC CCT 2651 
Pro Pro Leu Gly Leu Pro Gly Val Pro Gly lie Pro Leu Leu Pro Pro 
770 775 780 

CGG CCC TTG GGA AGG GGA TGG AGT CCA CCA ATG GAG AAC ATC GGT AGA 2699 
Arg Pro Leu Gly Arg Gly Trp Ser Pro Pro Met Glu Asn lie Gly Arg 
785 790 795 800 

CAG AGG TCA TGT TCA GAT CCT GCA AAC CCT CAA ACT CCT GAA CAA AAT 2747 
Gin Arg Ser Cys Ser Asp Pro Ala Asn Pro Gin Thr Pro Glu Gin Asn 
805 810 815 

AAC TCT GTG TAT GTT CTG CCT CCT GCT CCT CCA CCT CCT CCT GCA CCC 2795 
Asn Ser Val Tyr Val Leu Pro Pro Ala Pro Pro Pro Pro Pro Ala Pro 
820 825 830 

AAG AGA CCT CCA CCT CCA GAT CCA AAG GCC AGT CTT CTT CCT CCA GCA 2843 
Lys Arg Pro Pro Pro Pro Asp Pro Lys Ala Ser Leu Leu Pro Pro Ala 
835 840 845 

GCC ACG GCT CCT CCT GCA CCA TCC GCA CCG CTC CTT ATT CCA CCT GCT 2891 
Ala Thr Ala Pro Pro Ala Pro Ser Ala Pro Leu Leu lie Pro Pro Ala 
650 855 860 
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CCT CTC AGG CCA GCG CCT GTA GTG CCC CCT GCA CCA GTT ATG CCC ACT 2939 
Pro Leu Arg Pro Ala Pro Val Val Pro Pro Ala Pro Val Met Pro Thr 
865 870 875 880 

TCG TCA CTG ACT GAT GTC AAA AGT CTG CTG TCT AAA GCC GAG CTC ACA 2987 
Ser Ser Leu Thr Asp Val Lys Ser Leu Leu Ser Lys Ala Gin Leu Thr 
885 890 895 

TTG TGC GAT TTC GAA TAC TAC TAAATGATTG TAGCATCAGA GTGCACAAGT 3038 
Leu Cys Asp Phe Glu Tyr Tyr 
900 

ATGATCCGCA TGTGTCCCTC AGTTTTCATA ATGTCAGATT GAACCACAGT TAAGATGCAC 3098 

CAAACATGGA CACGCAAGAA AACTCACCCT GGAGTTTGGC ATCATCCATC TGTGACACCT 3158 

TCACTCTACT GCATCCTGAC ATGAAACCTC ACGGTAAACA TAAACAAACT GTAGCAACAC 3218 

TTTTACTTAC AACACGTCTC AGTGATAACC GGAAAAGGCA GTGGTTTGAA AGTGTCGTTC 3278 

TGATTGCGTC ATCAGATATA CCGCTCCTAT TGATTCTTGG TTAGACGCTC GTCTTAACTG 3338 

AATTCACACT TCAGCCAAGA GTCTGAACGC CCGACACCAC CAGAACTTCT TCATCAGAGG 3398 

GAAAATCTGA TCGTAGAGGC CATCAATCAA GGAATCAAAA ACTACAGATT TTAGGCTAGG 3458 

ATTACTGGAA TCTTTTAGGA TTTTCCATAT TAGTCTCAGA TGGCCAAATC ATCTCTGAAA 3518 

TTGCACAGTG TGAGCAGGGC TTAAATCAGA TCACCAAACT ATTGTTGAGA CCTAACACCA 3578 

CTGAATATTT AACAATCAAT ACACCCCTCA GCCATCCGTG TGGCTAATTG GTGGTGTACG 3638 

AGACATTCAC AAGCATTAAG ACCTCAGGAA GTGTTACTTT GATTACTTTG ATTCTAAGTG 3698 

CAATTACCTC TACCTTTAAT ACGGAAATCG TTTATGAACT GTGATGAGTG ATATGCATTA 3758 

TACGGGGACG GTTTGGTTTT ATTAAGCGAG ATGTGGTTGG ATGAGCTTTT TGTGTTTTTC 3818 

AGACAGCAGT GGCAGAGTGA CTCCTATTTG GCAAGTGTTT AAAGGCACAA TATGTAATAT 3878 

TCACCACAAG GGGGCACATA TTCACAACAA ACAAATGGTT ATGTCTGTTA GGGTGCTGCA 3938 

CTTTGCAGTG TAATAAAACG CACAACATTT TAAAGCGTCT TTGGAGTTTT TCTGTTTTCT 3998 

AGAAAACCAA ACTAGAAATC GAAGGTGATG AGCAACTGGA AAATGCAGGT GTATGATGTC 4058 

ATAAGCATGG AGACACTAGT TAAAATAACT TATATCTCTG GATTTGAACA TTCTTCCTAA 4118 

CCTTTGGGAT AATGCAAGTA CTCAAGCCAA AATATATCAC ACTGTTTTAG TGATTTTAGG 4178 

ATATTTGAAA GAAAATAATC GTACATATTG TGCCTTTAAG TAACATGATG AACCAGGTAG 4238 

GTTGCTTCTC AAGATTTGTT ACCAGACAAG CCATTAAACT TACTCTGCTT CATTTTCAGC 4298 
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